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FIELD OF THE INVENTION 

The present invention relates to polypeptides to be administered especially to humans and 
5 in particular for therapeutic use. The polypeptides are modified polypeptides whereby the 
modification results in a reduced propensity for the polypeptide to elicit an immune 
response upon administration to the hmnan subject. The invention in particular relates to 
the modification of bryodin 1 to result in bryodin 1 proteins that are substantially non- 
immunogenic or less inmiunogenic than any non-modified counterpart v/hen used in vivo. 
10 The invention relates furthermore to T-cell epitope peptides derived from said non- 
modified protein by means of which it is possible to create modified bryodin 1 variants 
with reduced immunogenicity. 



BACKGROUND OF THE INVENTION 

15 There are many instances whereby the efficacy of a therapeutic protein is limited by an 
unwanted immune reaction to the therapeutic protein. Several mouse monoclonal 
antibodies have shown promise as therapies in a number of human disease settings but in 
certain cases have failed due to the induction of significant degrees of a human anti- 
murine antibody (HAMA) response [Schroff, R. W. et al (1985) Cancer Res. 45: 879-885; 

20 Shawler, D.L. et al (1985) J. Immunol 135: 1530-1535], For monoclonal antibodies, a 
number of techniques have been developed in attempt to reduce the HAMA response 
[WO 89/09622; EP 0239400; BP 0438310; WO 91/06667]. These recombinant DNA 
approaches have generally reduced the mouse genetic information in the final antibody 
constract whilst increasing the human genetic information in the final construct. 

25 Notwithstanding, the resultant "humanised" antibodies have, in several cases, still elicited 
an immune response in patients [Issacs J.D. (1990) Sent. Immunol. 2: 449, 456; Rebello, 
P.R. etal (1999) Transplantation 6^: 1417-1420]. 

Antibodies are not the only class of polypeptide molecule administered as a therapeutic 
30 agent against which an immune response may be movinted. Even proteins of human 

origin and with the same amino acid sequences as occur within humans can still induce an 
inunune response in humans. Notable examples include the therapeutic use of 
granulocyte-niacrophage colony stimulating factor [Wadhwa, M. et al (1999) Clin. 
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Cancer Res. 5: 1353-1361] and interferon alpha 2 [Russo, D, et al (1996) Bri, J. Haem. 
94: 300-305;. Stein, R. et al (1988) New Engl J. Med. 318: 1409-1413] amongst others. 

A principal factor in the induction of an immune response is the presence within the 
5 protein of peptides that can stimulate the activity of T-cells via presentation on MHC 
class n molecules, so-called "T-cell epitopes". Such potential T-cell epitopes are 
commonly defined as any amino acid residue sequence with the ability to bind to MHC 
Class n molecules. Such T-cell epitopes can be measured to establish MHC binding. 
Implicitly, a "T-cell epitope" means an epitope which when bound to MHC molecules can 
10 be recognized by a T-cell receptor (TCR), and which can, at least in principle, cause the 
activation of these T-cells by engaging a TCR to promote a T-cell response. It is, 
however, usually understood that certain peptides which are found to bind to MHC Class 
n molecules may be retained in a protein sequence because such peptides are recognized 
as "self within the organism into which the final protein is administered, 

15 

It is known, that certain of these T-cell epitope peptides can be released during the 
degradation of peptides, polypeptides or proteins within cells and subsequently be 
presented by molecules of the major histocompatability complex (MHC) in order to 
trigger the activation of T-cells. For peptides presented by MHC Class n, such activation 
20 of T-cells can then give rise, for example, to an antibody response by direct stimulation of 
B-cells to produce such antibodies. 

MHC Class n molecules are a group of highly polymorphic proteins which play a central 
role in helper T-cell selection and activation. The human leukocyte antigen group DR 

25 (HLA-DR) are the predominant isotype of this group of proteins and are the major focus 
of the present invention. However, isotypes HLA-DQ and HLA-DP perform similar 
functions, hence the present invention is equally applicable to these. The MHC class n 
DR molecule is made of an alpha and a beta chain which insert at their C-termini through 
the cell membrane. Each hetero-dimer possesses a ligand binding domain which binds to 

30 peptides varying between 9 and 20 andno acids in length, although the binding groove 
can accommodate a maximtun of 11 amino acids. The ligand binding domain is 
comprised of anodno acids 1 to 85 of the alpha chain, and amino acids 1 to 94 of the beta 
chain. DQ molecules have recently been shown to have an homologous structure and the 
DP family proteins are also expected to be very similar. In humans approximately 70 
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different allotypes of the DR isotype are known, for DQ there are 30 different allotypes 
and for DP 47 different allotypes are known. Each individual bears two to four DR 
alleles, two DQ and two DP alleles. The structure of a number of DR molecules has been 
solved and such structures point to an open-ended peptide binding groove with a number 

5 . of hydrophobic pockets which engage hydrophobic residues (pocket residues) of the 
peptide [Brown et al Nature (1993) 364: 33; Stem et al (1994) Nature 368: 215]. 
Polymorphism identifying the different allotypes of class n molecule contributes to a 
wide diversity of different binding surfaces for peptides within the peptide binding grove 
and at the population level ensures maximal flexibility with regard to the ability to 

10 recognize foreign proteins and mount an inmiune response to pathogenic organisms. 
There is a considerable amount of polymorphism within the Ugand binding domain with 
distinct "families" within different geographical populations and ethnic groups. This 
polymorphism affects the binding characteristics of the peptide binding domain, thus 
different "families" of DR molecules will have specificities for peptides with different 

15 sequence properties, although there may be some overlap. This specificity determines 
recognition of Th-cell epitopes (Class n T-cell response) which are ultimately responsible 
for driving the antibody response to B-cell epitopes present on the same protein from 
which the Th-cell epitope is derived. Thus, the immune response to a protein in an 
individual is heavily influenced by T-cell epitope recognition which is a function of the 

20 p^tide binding specificity of that individual's HLA-DR allotype. Therefore, in order to 
identify T-cell epitopes within a protein or peptide in the context of a global population, it 
is desirable to consider the binding properties of as diverse a set of HLA-DR allotypes as 
possible, thus covering as high a percentage of the world population as possible. 

25 An immune response to a therapeutic protein proceeds via the MHC class II peptide 
presentation pathway. Here exogenous proteins are engulfed and processed for 
presentation in association with MHC class II molecules of the DR, DQ or DP type. 
MHC Class n molecules are expressed by professional antigen presenting cells (APCs), 
such as macrophages and dendritic cells amongst others. Engagement of a MHC class II 

30 peptide complex by a cognate T-cell receptor on the surface of the T-cell, together with 
the cross-binding of certain other co-receptors such as the CD4 molecule, can induce an 
activated state within the T-cell. Activation leads to the release of cytokines further 
activating other lymphocytes such as B cells to produce antibodies or activating T kiUer 
cells as a full cellular immune response. 
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The abibty of a peptide to bind a given MHC class n molecule for presentation on the 
surface of an APC is dependent on a number of factors most notably its primary 
sequence. This will influence both its propensity for proteolytic cleavage and also its 
affinity for binding within the peptide binding cleft of the MHC class 11 molecule. The 
5 MHC class n / peptide complex on the APC surface presents a binding face to a particular 
T-cell receptor (TCR) able to recognize determinants provided both by exposed residues 
of the peptide and the MHC class n molecule. 

In the art there are procedures for identifying synthetic peptides able to bind MHC class II 
10 molecules (e.g. W098/52976 and WOOO/34317). Such peptides may not function as T- 
cell epitopes in all situations, particularly, in vivo due to the processing pathways or other 
phenomena. T-cell epitope identification is the first step to epitope elimination. The 
identification and removal of potential T-cell epitopes from proteins has been previously 
disclosed. In the art methods have been provided to enable the detection of T-cell epitopes 
. 15 usually by computational means scanning for recognized sequence motifs in 
experimentally determined T-cell epitopes or alternatively using computational 
techniques to predict MHC class Il-binding peptides and in particular DR-binding 
peptides. 

20 W098/52976 and WOOO/343 17 teach computational tiireading approaches to identifying 
polypeptide sequences with the potential to bind a sub-set of human MHC class n DR 
allotypes. In these teachings, predicted T-cell epitopes are removed by the use of 
judicious amino add substitution within the primary sequence of the therapeutic antibody 
or non-antibody protein of both non-human and human derivation. 

25 

Other techniques exploiting soluble complexes of recombinant MHC molecules in 
combination with synthetic peptides and able to bind to T-cell clones from peripheral 
blood samples from human or experimental animal subjects have been used in the art 
[Kern, F. et al (1998) Nature Medicine 4:975-978; Kwok. W.W. et al (2001) TRENDS in 
30 Immunol 22:583-588]. These and other schemes including for example the use of whole 
proteins or synthetic peptides or variant molecules to the protein of interest may be 
screened for molecules with altered ability to bind or stimulate T-cells may equally be 
exploited in an epitope identification strategy. 
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As depicted above and as consequence thereof, it would be desirable to identify and to 
remove or at least to reduce T-cell epitopes from a given in principal therapeutically 
valuable but originally immunogenic peptide, polypeptide or protein. 

5 One of these therapeutically valuable molecules is bryodin 1, The present invention 
provides for modified forms of bryodin 1 with one or more T cell epitopes removed. The 
sequence of bryodin 1 protein as given by Gawlak et al [Gawlak, S. et al (1997) 
Biochemistry 36:3095-3103] is depicted in single-letter code as follows: 

DVSFRLSGATTTSYGWIKNLREALPYERKVYlSrrPLLRSSISGSGRYTLLHLT^ 
10 VTNVYIMGYLAGDVSYFFNEASATEAAKFWKDT^KK^ 

ALDSAITTLYYYTASSiUVSALLVLIQSTAESARYKFIEQQIGKRVDKTFLPSLATISriE^^ 

SKQIQIASTNNGQFESPVVLIDGISINQRVSITNASAR^^ 

EHGLYGI 

15 The bryodin 1 protein is single polypeptide of 267 amino acids with a molecular weight 
of approximately 29,000 Da. Bryodin 1 is a type 1 ribosome inactivating protein (RIP) 
originally isolated from the roots of the plant Bryonia dionica [US,5541110]. There is 
considerable interest in this and other RIPs on account of their toxicity to living cells. In 
particular recombinant forms in fusion with cell-specific targeting domains (e.g, 

20 antibodies) have potential value in many therapeutic areas where the selective killing of 
particular cell populations is a desired outcome. 

It is a particular objective of the present invention to provide modified bryodin 1 proteins 
in which the immune characteristic is modified by means of reduced numbers of potential 
T-cell epitopes. 

25 

Others have provided bryodin molecules and in particular recombinant bryodin 1 
[US ,5541 110; US,5932447], but these teachings do not recognise the importance of T cell 
epitopes to the immunogenic properties of the protein nor have been conceived to directly 
influence said properties in a specific and controlled way according to the scheme of the 
30 present invention. By contrast, the PCT patent application WOOO/34317 published 15- 
June 2000 discloses a modified bryodin 1 molecule including substitutions at positions 
5,6, 18, 27, 111, 164, 216, 222, 237 and 249. The substitutions have been selected on the 
basis of an in silica motif matching tool and do not address the most biologically relevant 
MHC class n epitopes detected in a biological assay and which are for the first time 
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disclosed herein. Moreover where the present invention discloses sequences to be 
considered as the biologically relevant epitopes in the subject molecule the inventors have 
recognized largely identical sequences in related proteins namely a-trichosanthin, a- 
momorcharin and 6-momorcharin which accordingly by structural homology are relevant 
5 epitopes also in these proteins. 

There is a continued need for bryodin 1 analogues with enhanced properties. Desired 
enhancements include alternative schemes and modalities for the expression and 
purification of the said therapeutic, but also and especially, improvements in the 
10 biological properties of the protein. There is a particular need for enhancement of the in 
vivo characteristics when administered to the human subject. In this regard, it is highly 
desired to provide bryodin 1 with reduced or absent potential to induce an immune 
response in the human subject. 

15 SUMMARY AND DESCRIPTION OF THE INVENTION 

The present invention proyides for modified forms of bryodin 1, in which the immune 
characteristic is modified by means of reduced numbers of potential T-cell epitopes. 

20 The invention discloses sequences identified within the bryodin 1 primary sequence that 
are potential T-cell epitopes by virtue of MHC class n binding potential- This disclosure 
specifically pertains the bryodin 1 protein which inclusive of an N-terminal pro-peptide 
comprises 267 amino acid residues. 

25 The present invention discloses the major regions of the bryodin 1 primary sequence that 
are immunogenic in man and thereby provide the critical information required to conduct 
modification of the sequence to eliminate or reduce the inmiunogenic effectiveness of 
these sites. 

30 In one embodiment, synthetic peptides comprising the said inmaunogenic regions can be 
provided in a pharmaceutical composition for the purpose of promoting a tolerogenic 
response to tiie whole molecule. 
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In a further embodiment bryodin 1 molecules modified within the epitope regions herein 
disclosed can be used in pharmaceutical compositions. 
In sunmiary ttie invention relates to the following issues: 

• using a panel of synthetic peptides in a naive T-cell assay to map the immunogenic 
5 regions of bryodin 1; 

• bryodin 1 derived peptide sequences found to evoke a stimulation index of greater 
than around 2 in a naive T-cell assay; 

• a molecule comprising a modified version of the bryodin 1 amino acid sequence and 
able to evoke a stimulation index of less than the value evoked by a wild-type bryodin 

10 1 amino acid sequence in a T-cell proliferation assay using cells from a donor 
responsive to bryodin 1; 

• a modified molecule having the biological activity of bryodin 1 and being 
substantially non-inomunogenic or less immunogenic than any non-modified molecule 
having the same biological activity when used in vivo; 

15 • an accordingly specified molectde, wherein said loss of immunogenicity is achieved 
by removing one or more T-cell epitopes derived from the originally non-modified 
molecule; 

• an accordingly specified molecule, wherein said loss of immunogenicity is achieved 
by reduction in numbers of MHC allotypes able to bind peptides derived from said 

20 molecule; 

• an accordingly specified molecule, wherein said originally present T-cell epitopes are 
MHC class n ligands or peptide sequences which show the ability to stimulate or bind 
T-cells via presentation on class 11; 

• an accordingly specified molecule, wherein said peptide sequences are selected from 
25 the group as dqpicted in FIGURE 1 ; 

• an accordingly specified molecule, wherein 1-9 amino acid residues, preferably one 
amino acid residue in any of the originally present T-cell epitopes are altered; 

• an accordingly specified molecule, wherein the alteration of the amino acid residues is 
substitution, addition or deletion of originally present amino acid(s) residue(s) by 

30 other amino acid residue(s) at specific position(s); 

• an accordingly specified molecule, wherein, if necessary, additionally further 
alteration usually by substitution, addition or deletion of specific amino acid(s) is 
conducted to restore biological activity of said molecule; 
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• a peptide molecule of above sharing greater than 90% amino acid identity with any of 
the peptide sequences of FIGURE 1; 

• a peptide molecule of above sharing greater than 80% atnino acid identity with any of 
the peptide sequences of FIGURE 1; 

S • peptide sequences as above able to bind MHC class U; 

• an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within HGURE 1 ; 

• an accordingly specified molecule wherein alteration is conducted at one or more 

10 residues from any or all of the string of contiguous residues of sequences (a), (b), (c), 
(d), or (e) as below wherein said sequences are derived from the bryodin 1 wild-type 
sequence where using single letter code; 

(a) = RYTLLHLTNYADETISVAVDV (Rl), 

(b) = ATEAAKFVFKDAEOCK 

15 (c) = ERLQTAAGKIRENIPLGLPALDSA (R3), 

(d) = ITTLYYYTASSAASALLVLIQSTAESA (R4), 

(e) = ATISLENNWSALSKQIQIAST (R5), 

• a peptide molecule comprising 13-15 consecutive residues from any of sequences (a), 
(b), (c), (d) or (e) above; 

20 • a peptide molecule comprising at least 9 consecutive residues from any of the 
sequences (a), (b), (c) (d) or (e) above; 

• a peptide molecule of above sharing greater than 90% amino acid identity with any of 
the peptide sequences derived from (a), (b), (d) or (e) above; 

• a peptide molecule of above sharing greater than 80% amino acid identity with any of 
25 the peptide sequences derived from (a), (b), (c) (d) or (e) above; 

• peptide sequences as above able to bind MHC class IE; 

• an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within any of sequences (a), (b), (c), (d) or (e) above; 

30 • an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within sequence (a) and or (e) above; 
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• an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within sequence (a) and or (e) and additional substitutions made within 
sequence (c) and or (d) above; 

5 • a peptide sequence consisting of at least 9 consecutive amino acid residues of any of 
the sequences (a), (b), (c), (d) or (e) as specified above and its use for the manufacture 
of bryodinl, a-trichosanthin, a-momorcharin or fi-momorcharin having substantially 
no or less immunogenicity than any non-modified molecule and having the biological 
activity of a t3^e 1 RIP when used in vivo\ 

10 • a pharmaceutical composition comprising any of the peptides or modified peptides of 
above having the activity of binding to MHC class 11; 

• a DNA sequence or molecule which codes for any of said specified modified 
molecules as defined above and below; 

• a pharmaceutical composition comprising a modified molecule having the biological 
15 activity of bryodin 1 ; 

a pharmaceutical composition as defined above and / or in the claims, optionally 
together with a pharmaceutically acceptable carrier, diluent or excipient; 

• a method for manufacturing a modified molecule having the biological activity of 
bryodin 1 as defined in any of the claims comprising the following steps: (i) 

20 determining the amino acid sequence of the polypeptide or part thereof; (ii) identifying 
one or more potential T-cell epitopes within the amino acid sequence of the protein by 
any method including determination of the binding of the peptides to MHC molecules 
using in vitro or in silica techniques or biological assays; (iii) designing new sequence 
variants with one or more amino acids within the identified potential T-cell epitopes 

25 modified in such a way to substantially reduce or eliminate the activity of the T-cell 
epitope as determii^ied by the binding of the peptides to MHC molecules using in vitro 
or in silico techniques or biological assays; (iv) constructing such sequence variants by 
recombinant DNA techniques and testing said variants in order to identify one or more 
variants with desirable properties; and (v) optionally repeating steps (ii) - (iv); 

30 • an accordingly specified method, wherein step (iii) is carried out by substitution, 

addition or deletion of 1 - 9 amino acid residues in any of the originally present T-cell 
epitopes; 
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• an accordingly specified method, wherein the alteration is made with reference to an 
homologous protein sequence and / or in silica modeling techniques; 

• an accordingly specified method, wherein step (ii) of above is carried out by the 
following steps: (a) selecting a region of the peptide having a known amino acid 

5 residue sequence; (b) sequentially sampling overlapping amino acid residue segments 
of predetermined uniform size and constituted by at least three anoino acid residues 
from the selected region; (c) calculating MHC Class n molecule binding score for each 
said sampled segment by sununing assigned values for each hydrophobic amino acid 
residue side chain present in said sampled amino acid residue segment; and (d) 

10 identifying at least one of said segments suitable for modification, based on the 

calculated MHC Class EE molecule binding score for that segment, to change overall 
MHC Class n binding score for the peptide without substantially reducing therapeutic 
utility of the peptide; step (c) is preferably carried out by using a B6hm scoring 
function modified to include 12-6 van der Waal's ligand-protein energy repulsive term 

15 and ligand conformational energy term by (1) providing a first data base of MHC Class 
n molecule models; (2) providing a second data base of allowed peptide backbones for 
said MHC Class n molecule models; (3) selecting a model from said first data base; 
(4) selecting an allowed peptide backbone fix>m said second data base; (3) identifying 
amino acid residue side chains present in each sampled segment; (6) detemfiining the 

20 binding affinity value for all side chains present in each sampled segment; and 
repeating steps (1) through (5) for each said model and each said backbone; 

• a 13mer T-cell epitope peptide having a potential MHC class n binding activity and 
created from non-modified bryodin 1, selected from the group as depicted in FIGURE 
1 and its use for the manufacture of bryodin 1 having substantially no or less 

25 immunogenicity than any non-modified molecule with the same biological activity 
when used in vivo; 

• a peptide sequence consisting of at least 9 consecutive amino acid residues of a 13mer 
T-ceU epitope peptide as derived from any of the sequences in FIGURE 1 and its use 
for the manufacture of bryodin 1 having substantially no or less inmiunogenicity than 

30 any non-modified molecule and having the biological activity of a bryodin molecule 
when used in vivo. 
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• a bryodin 1 molecule of structure according to Formula I: 

X^DVSFRLSGATTTSYGWIKNLI^ALPYERKVYNIPLLRSSISGSGRyX^X^LX^LTX^X^^ 
TX^SVAX''DX^TlSrVYIMGYLAGDVSYFFNEASATEAAiaC^X^°FKDAK^^ 

tx^ax^^x^^x^'x^'ei^^^plgx^pax^^dsax^^ttx^yx^x^^tassaasax^x'^x^^^^ 

5 AESARYKFIEQQIGKRVDKTFLPSIATX^'SX^^EiraWSAX^^SX^^QX^^QX^^ 
VlilDGlSnSrQRVSITNASARVVTSNIALLLNRiSINIAAIGEDISM 

wherein 

X® is hydrogen or a targeting moiety such as an antibody domain; 
is most preferably A but G and P aie also considered; 
10 X^ is most preferably M but A, G, P and I are also considered; 
X^ is most preferably A but G and P are also considered; 
X* is most preferably P but Y is also considered; 
X^ is most preferably T but S is also considered; 
X^ is P; 

15 X^ is most preferably A but P and G are also considered; 
X^ is most preferably A but P and G are also considered; 

X^ is most preferably A but P, G, H, D, E, N, Q, K, R, S and T are also considered; 
X^° is most preferably A but P and G are also considered; 
X^^ is most preferably A but P and G are also considered; 
20 X^^ is most preferably A but P, S, T. H and K are also considered; 
X^^isT; 
X^^isH; 
X^^ is S; 

X^^ is most preferably A, but S, T. P, N, D, E, G, H, K and Q are also considered; 
25 X^'^isT; 

X^^ is most preferably A but P is also considered; 

X^^ is most preferably A but I, F, G, M, P, V, W and Y are also considered; 
X^° is most preferably F but P and W are also considered; 
X^^ is most preferably A but P and G are also considered; 
30 yP' is most preferably G but A and P are also considered; 
X^^ is most preferably G but A and P are also considered; 
X^"* is most preferably A but P and G are also considered; 
X^ is most preferably A but P, G, S and T are also considered; 
X^^ is most preferably A but I, M, S, T, P and G are also considered; 
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X^^ is most preferably A but G and P are also considered; 

X^^ is most preferably S but A, G, P, T, H, D, N, Q, K and R are also condidered; 
X^^ is most preferably T but A, G, S, P, H, K, R, D, E, N and Q are also considered; 
X^^ is most preferably A but G, S, T, P, K, R, H, D, E, N and Q are also considered; 
5 X^MsQ; 

X^^ is most preferably H but D, E, F, L, N, P, S, W and Y are also considered; 
X^^ is most preferably T but A, G, P, D, E, H, K, R, N, Q, S and T are also considered; 
X^ is most preferably D, 
and whereby simultaneously 
10 X*=T,X2 = L,X^ = H,X^ = N.X' = Y,X^=:I,X^ = V.X« = V,X^ = F,X^^ = V,X" = V. ' 
X^^ = L,X^^ = A,X^^ = G,X^^ = K,X^*^ = I,X" = R.X^' = I.X^^ = L,X^ = L,X2^ = I,^^ 
X^ = Y,X^ = Y,X^ = L,X2^ = L,X^^ = V,X^« = L,X2^=I,X^ = L.X'' = L,X^^ = K, 
X^^ = IandX^ = I 
are excluded. 

15 

The term "T-cell epitope" means according to the understanding of this invention an 
amino acid sequence which is able to bind MHC class n, able to stimulate T-cells and / or 
also to bind (without necessarily measurably activating) T-cells in complex with MHC 
class n. 

20 The term "peptide" as used herein and in the appended claims, is a compound that 

includes two or more amino acids. The amino acids are linked together by a peptide bond 
(defined herein below). There are 20 different naturally occurring amino acids involved 
in the biological production of peptides, and any number of them may be linked in any 
Older to form a peptide chain or ring. The naturally occurring amino acids employed in 

25 the biological production of peptides all have the L-configuration. Synthetic peptides can 
be prepared employing conventional synthetic methods, utihzing L-amino acids, D-amino 
acids, or various combinations of amino acids of the two different configurations. Some 
peptides contain only a few amino acid units. Short peptides, e.g., having less than ten 
amino acid units, are sometimes referred to as "oligopeptides". Other peptides contain a 

30 large number of amino acid residues, e.g. up to 100 or more, and are referred to as 

"polypeptides". By convention, a "polypeptide" may be considered as any peptide chain 
containing three or more amino acids, whereas a "oligopeptide" is usually considered as a 
particular type of "short" polypeptide. Thus, as ixsed herein, it is understood that any 
reference to a "polypeptide" also includes an oligopeptide. Further, any reference to a 
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"peptide" includes polypeptides, oligopeptides, and proteins. Each different arrangement 
of amino acids forms different polypeptides or proteins. The number of polypeptides-and 
hence the number of different proteins-that can be formed is practically unlimited. 
"Alpha carbon (Ca)" is the carbon atom of the carbon-hydrogen (CH) component that is 
5 in the peptide chain. A "side chain" is a pendant group to Ca that can comprise a simple 
or complex group or moiety, having physical dimensions that can vary significantiy 
compared to the dimensions of the peptide. 

The invention may be applied to any bryodin 1 species of molecule with substantially the 
same primary anaino acid sequences as those disclosed herein and would include therefore 
10 bryodin 1 molecules derived by genetic engineering means or other processes and may 
contain more or less than 267 amino acid residues. 

The invention is conceived to overcome the practical reality that soluble proteins 
introduced with therapeutic intent in man trigger an inmiune response resulting in 
15 development of host antibodies that bind to the soluble protein. The present invention 
seeks to address this by providing bryodin X proteins with altered propensity to elicit an 
immune response on administration to the human host. According to the methods 
described herein, the inventors have discovered the regions of the bryodin 1 molecule 
comprising the aitical T-cell epitopes driving the immune responses to this protein. 

20 

The general method of the present invention leading to the modified bryodin 1 comprises 
the following steps: 

(a) determining the amino acid sequence of the polypeptide or part thereof; 

(b) identifying one or more potential T-cell epitopes within the amino acid sequence of 
25 the protein by any method including determination of the binding of the peptides to MHC 

molecules using in vitro or in silico techniques or biological assays; 

(c) designing new sequence variants with one or more amino acids within the identified 
potential T-cell epitopes modified in such a way to substantially reduce or elinodnate the 
activity of tiie T-cell epitope as determined by the binding of the peptides to MHC 

30 molecules using in vitro or in silico techniques or biological assays. Such sequence 
variants are created in such a way to avoid creation of new potential T-cell epitopes by 
the sequence variations unless such new potential T-ceU epitopes are, in turn, modified in 
such a way to substantially reduce or elinoinate the activity of the T-cell epitope; and 
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(d) constructing such sequence variants by recombinant DNA techniques and testing said 
variants in order to identify one or more variants with desirable properties according to 
well known recombinant techniques. 

5 The identification of potential T-cell epitopes according to step (b) can be carried out 
according to methods described previously in the art. Suitable methods are disclosed in 
WO 98/59244; WO 98/52976; WO 00/34317; WO 02/069232 and may be used to 
identify binding propensity of bryodin Iderived peptides to an MHC class n molecule. In 
practice, the compositions embodied in the present invention have been derived with the 

10 concerted application of biological ex vivo human T-cell proliferation assays and a 
software tool exploiting the scheme outlined in WO 02/069232 and which is. an 
embodiment of the present invention. 

The software simulates the process of antigen presentation at the level of the peptide 
15 MHC class n binding interaction to provide a binding score for any given peptide 
sequence. Such a score is determined for many of the predominant MHC class n 
allotypes extant in the population. As this scheme is able to test any peptide sequence, 
the consequences of amino acid substitutions additions or deletions with respect to the 
ability of a peptide to interact with a MHC class n binding groove can be predicted. 
20 Consequently new sequence compositions can be designed which contain reduced 
numbers of peptides able to interact with the MHC class H and thereby function as 
iimnmunogenic T-cell epitopes. Where the biological assay using any one given donor 
sample can assess binding to a maximum of 4 DR allotypes, the in silico process can test 
the same peptide sequence using >40 allotypes simultaneously. In practice this approach 
25 is able to direct the design of new sequence variants which are compromised in the their 
ability to interact with multiple MHC allotypes. 

By way of an example of this in silico approach, the results of an analysis conducted on 
the entire bryodin Isequence is provided as FIGURE 1. Therein are listed 13mer peptide 
30 sequences derived from bryodin I detected to have the capability to bind one or more 
MHC class n allotypes with a significant binding score. Taken in its entirety, this dataset 
of 13mer peptides is considered to provide with a high degree of certainty, the universe of 
permissible MHC class ligands for the bryodin 1 protein. For reasons such as the 
requirement for proteolytic processing of the complete bryodin 1 polypeptide and other 
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physiologic steps leading to the presentation of bryodin 1 peptides in vivo, it would be 
clear that a relatively minor sub-set of the entire repertoire of peptides will have ultimate 
biological relevance. In order to further identify such biologically relevant peptides, the 
inventors have developed an approach exploiting ex vivo human T-cell proliferation 
5 assays. 

This approach has proven to be a particularly effective method and is disclosed herein as 
an embodiment of the invention. The method can be applied to test part of the sequence, 
for example a sub-set of bryodin 1 peptides such as all or some of those listed in FIGURE 

10 1; or the method may be applied to test entire sequence. In the present studies, the 

method has involved the testing of overlapping bryodin 1 derived peptide sequences in a 
scheme so as to scan and test the entire bryodin 1 sequence (including peptides 
representing the N-terminal pro-pepeptide). The synthetic peptides are tested for their 
ability to evoke a proliferative response in human T-cell cultured in vitro. Where this 

15 type of approach is conducted using naive human T-cells taken from healthy donors, the 
inventors have established that in the operation of such an assay, a stimulation index 
equal to or greater than 2.0 is a useful measure of induced proliferation. The stimulation 
index is conventionally derived by division of the proliferation score (e.g. counts per 
minute of radioactivity if using ^-thymidine incorporation) measured to the test peptide 

20 by the score measured in cells not contacted with a test peptide. 

The present studies have uncovered some 32 peptide sequences able to evoke a significant 
proliferative response (i.e. SI>2.0) in T-cells derived from at least one donor. Within this 
set of peptides, a further sub-set of peptides have been identified which evoke a 
25 significant proliferative response in 2 or more individual donor samples and for some of 
theses responses the magnitude of response has indeed been significantiy higher than 
SI=2.0. 

It is most preferred to provide a bryodin 1 molecule in which amino acid modification 
30 (e.g. a substitution) is conducted within the most immunogenic regions of the parent 
molecule. The inventors herein have discovered that tihe most immunogenic regions of 
the bryodin 1 molecule in man are confined to at least five regions Rl - R5 encompassing 
. residues 46-66; 88-102; 112-135; 136-162 andl78-204 comprising respectively amino 
acid sequences; 
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Rl) RYTLLHLTNYADETISVAVDV; 

R2) ateaakfvfkdakkk; 

R3) erlqtaagkireniplglpaldsa; 

R4) ITTLYYYTASSAASAIiLVLIQSTAESA and 

5 R5) ATISLENNWSALSKQIQIAST. 

These regions have been identified on the basis of giving SI > 2 in one or more donor 
PBMC samples. For example epitope region Rl was proven to be reactive in 6 different 
donor samples representing over 28% of the donor samples screened. Similarly the R2 
and R3 epitopes were reactive with 3 (14%) of donor samples tested, R4 with 5 (24%) of 

10 donor samples and R5 with 4 (19%) of donors tested. Taken together regions Rl - R5 
were reactive with 10 of the 21 (48%) donor PBMC samples tested covering a wide range 
of allotypic specificities. 

The major preferred embodiments of the present invention comprise bryodin 1 molecules 
15 for which the MHC class n ligands identified within any of the epitopes Rl - R5 are 

altered such as to eliminate binding or otherwise reduce the numbers of MHC allotypes to 
which the peptide can bind. 

Where multiple potential epitopes are identified and in particular where a number of 
20 peptide sequences are found to be able to stimulate T-cells in a biological assay, 

cognisance may also be made of the structural features of the protein in relation to its 
propensity to evoke an immune response via the MHC class II presentation pathway. For 
example where the crystal structure of the protein of interest is known the 
crystallographic B-factor score may be analysed for evidence of structural disorder within 
25 the protein, a parameter suggested to correlate with the proximity to the biologically 

relevant immunodominant peptide epitopes [Dai G. et al (2001) J. Biological Chem, 276: 
41913-41920]. Such an analysis when conducted on the bryodin 1 crystal stracture model 
[PDB ID: IBRY, Gawlak, S. L„ et al (1997) Biochemistry 36: 3095] suggests a high 
likelihood for multiple immunodominant epitopes with at least 4 discrete zones mapping 
30 to the medial position of areas with above average B-factor scores. Of the these 4 areas, 
3 mapped to the N-terminal boundary of peptides shown to evoke a proliferative response 
in the nai've T-cell assay of EXAMPLE 2. 
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This data taken together with the data for the numbers of naiVe donors responding to 
particular peptides enables a predicted ranking of the most inmiimodominant regions of 
the molecule. It is however recognised that in practice, each of these regions are 
considered immunogenic in man and therefore require modification under the scheme of 

5 the invention. Accordingly, with reference to the above defined sequence strings Rl — 
R5, sequences may be ranked in the order {Rl, R5}, {R3, R4}, R2; where {Rl, R5} are 
considered the most immunogenic sequences and R2 relatively less immunogenic. Equal 
ranking is ascribed to those sequences in brackets. On this basis the most preferred 
bryodin 1 compositions under the scheme of the present involve modifications within 

10 epitope regions Rl and R5. Compositions containing in addition modifications within 
epitope regions R3 and R4 are also desired and optionally also additional substitutions 
within epitope region R2. 

The disclosed peptide sequences herein represent the critical information required for the 
15 constmction of modified bryodin 1 molecules in which one or more of these epitopes is 
compromised. Under the scheme of the present, the epitopes are compromised by 
mutation to result in sequences no longer able to function as T-cell epitopes. It is possible 
to use recombinant DNA methods to achieve directed mutagenesis of the target sequences 
and many such techniques are available and well known in the art. In practice a number 
20 of variant bryodin proteins will be produced and tested for the desired immune and 
fimctional characteristic. 

Where it is the objective of this invention to modify the amino acid sequences of at least 
one or more of the above listed peptides from FIGURE 1, it is most preferred to modify 
25 the sequence of one or more of the epitope regions Rl - R5 identified above. There are 
herein disclosed suitable modifications which achieve the objective of reducing or 
eliminating the capabilities of the subject peptide sequence to function as a T-cell epitope 
and which may result in a bryodin 1 molecule with a reduced immunogenic potential 
when administered as a therapeutic to the human host. 

30 

For the elimination of T-cell epitopes, amino acid substitutions are preferably made at 
appropriate points within the peptide sequence predicted to achieve substantial reduction 
or elimination of the activity of the T-cell epitope. In practice an appropriate point will 
preferably equate to an amino acid residue binding within one of the pockets provided 
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within the MHC class n binding groove. It is most preferred to alter binding within the 
first pocket of the cleft at the so-called PI or PI anchor position of the peptide. The 
quality of binding interaction between the PI anchor residue of the peptide and the first 
pocket of the MHC class II binding groove is recognised as being a major determinant of 

5 overall binding affinity for the whole peptide. An appropriate substitution at this position 
of the peptide will be for a residue less readily acconmaodated within the pocket, for 
example, substitution to a more hydrophilic residue. Combinations of substitation within 
a single epitope may be contemplated and for example can be particularly appropriate 
where individually defined epitopes are in overlap with each other. Moreover, amino acid 

10 substitutions either singly within a given epitope or in combination within a single epitope 
may be made at positions not equating to the "pocket residues" with respect to the MHC 
class n binding groove, but at any point within the peptide sequence. Substitutions may 
be made with reference to an homologous structure or structural method produced using 
in silico techniques known in the art and may be based on known structural features of the 

15 molecule according to this invention. All such substitutions fall within tiie scope of the 
present invention. 

Amino acid substitutions other than within the peptides identified above may be 
contemplated particularly when made in combination with substitution(s) made within a 
20 listed peptide. For example a change may be contemplated to restore structure or 

biological activity of the variant molecule. Such compensatory changes and changes to 
include deletion or addition of particular amino acid residues from the bryodin 1 
polypeptide resulting in a variant with desired activity and in combination with changes in 
any of the disclosed peptides fall under the scope of the present. 

25 

One example of such a set of preferred modifications is provided by the disruption of the 
Rl epitope region. Complete elimination of all possible MHC ligands within this region 
is achieved by the a substitution set comprising the changes; T49A, L50M, HsaA, N55P, 
Y56T, l6iP, V65 A and V67A. Such preferred changes either in isolation or in combination 
30 are an embodiment of the invention 

Similarly, a preferred set of modifications achieving the disruption of the R2 epitope is 
provided by the substitution set F99A, ViooA and Vios A- Such preferred changes either in 
isolation or in combination are an embodiment of the invention. 
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For epitope region R3 alternative substitution sets are defined based on knowledge of the 
key structural features of the molecule. It would be highly desired to construct a modified 
bryodin 1 molecule containing substitution at leucine residue 115 (L115), as this residue 
can function as a PI anchor for one MHC class n ligand identified within the R3 epitope. 
. 5 A preferred set of substitutions would accordingly comprise L115A, Ii22A, IiaaA L130A, 
L133F and I137 A. However as Li 15 is located to the floor of the binding cleft for RIP 
activity this substitution may compromise the functional activity of the molecule. An 
altemative set of substitutions can be defined which serve to disrupt the significant MHC 
ligands within the R3 epitope and yet which maintain Lns- Accordingly these 
10 substitutions comprise AngT, G120H, K121S and R123T and would be made in altemative to 
the dual changes LnsA and I122A. All changes are an embodiment of the invention. 

For epitope region R4, a preferred substitution set comprises the changes Li4oG> Y142G, 
YusA in combination with L152A, L153A, V154A and L155S. All changes either in isolation 
15 or in combination are embodiment of the invention, 

A yet further example of a set of preferred modifications is provided by the dismption of 
the R5 epitope region using the changes comprising I187T, L189A, L196Q, K197H, I200T and 
I202D- All changes either in isolation or in combination are embodiment of the invention, 

20 

For nearly all of the above preferred substitutions altemative amino acids may be 
considered at any given position. The choices of altemative residue are however not un- 
limited and are confined to residues satisfying the broad objectives of reducing or 
eliminating the potential MHC peptide interaction and also being accommodated within 
25 the stmcture of the molecule; i.e. significant side chain clashes are avoided for most 

rotamers and or electrostatic or other contacts are either preserved or made. Examples of 
altemative residue choices which may be considered are provided in the bryodin 1 
structure as depicted in the FORMULA 1. 

30 From the foregoing it can be seen that according to this invention a number of variant 
bryodin 1 proteins can be produced and tested for the desired immune and functional 
characteristic and all such functional proteins are embodiments of the present invention. 
Moreover the modifications conducted have been demonstrated to result in peptide 
sequences not able to bind MHC class n molecules with the same affinity as the parental 
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or "wild-type" (wt) peptide sequence using the predictive in silico MHC class n binding 
toolofW002/069232. 

The preferred molecules of this invention can be prepared in any of several ways but is 
5 most preferably conducted exploiting routine recombinant methods. It is a relatively 
facile procedure to use the protein sequences and information provided herein to deduce a 
polynucleotide (DNA) encoding any of the preferred protein sequences. This can be 
achieved for example using computer software tools such as the DNSstar software suite 
[DNAstar Lie, Madison, WI, USA] or similar. Any such DNA sequence with the 
10 capability of encoding the preferred polypeptides of the present or significant homologues 
thereof, should be considered as embodiments of this invention. 

As a general scheme, genes encoding any of the preferred bryodin 1 protein sequences 
can be made using gene synthesis and cloned into a suitable expression vector. In tum the 

15 expression vector is introduced into a host cell and cells selected and cultured. The 
preferred molecides are purified from the culture medium and formulated into a 
preparation for therapeutic administration. Alternatively, a wild-type bryodin 1 gene 
sequence can be obtained for example following a cDNA cloning strategy using RNA 
prepared from the root tissues of the Bryonia plant. The wild-type gene can be used as a 

20 template for mutagenesis and constmction preferred variant sequences, hi this regard it is 
particularly convenient to use the stmtegy of "overlap extension PCR" as described by 
Higuchi et al [Higuchi et al (1988) Nucleic Acids Res. 16: 7351] although other 
methodologies and systems could be readily applied. 

25 Constitution of the preferred bryodin 1 molecule may be achieved by recombinant DNA 
techniques and this includes bryodin 1 molecules fused with desired anti-body variable 
region domains or otlier targeting moieties. Methods for purifying and manipulating 
recombinant proteins including fusion proteins are well known in the art. Necessary 
techniques are explained fully in the literature, such as, "Molecular Cloning: A 

30 Laboratory Manual", second edition (Sambrook et al., 1989); "Oligonucleotide Synttiesis" 
(M. J. Gait, ed., 1984); "Animal CeU Culture" (R. L Freshney, ed., 1987); "Methods in 
Bcizymology" (Academic Press, Inc.); "Handbook of Experimental Immunology" (D. M. 
Weir & C. C, Blackwell, eds.); "Gene Transfer Vectors for Mammalian Cells" (J. M. 
Miller & M. P. Calos, eds., 1987); "Current Protocols in Molecular Biology" (F. M. 
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Ausubel et al., eds„ 1987); "PGR; The Polymerase Chain Reaction", (Mullis et al., eds., 
1994); "Current Protocols in Immunology" (J. E. Coligan et al., eds., 1991). 

As will be clear to the person skilled in the art, multiple alternative sets of substitutions 
5 could be arrived at which achieve the objective of removing un-desired epitopes. The 
resulting sequences would however be recognised to be closely homologous with the 
specific compositions disclosed herein and therefore fall under the scope of the present 
invention. 

10 In as far as this invention relates to modified bryodin 1, compositions containing such 
modified bryodin proteins or fi-agments of modified bryodinproteins and related 
compositions should be considered within the scope of the invention. In another aspect, 
the present invention relates to nucleic acids encoding modified bryodin entities. In a 
further aspect the present invention relates to methods for therapeutic treatment of 

15 humans using the modified bryodin 1 proteins. In this aspect the modified bryodin 1 
protein may be linked with an antibody molecule or firagment of an antibody molecule. 
The linkage may be by means of a chemical cross-linker or the bryodin 1-antibody may 
be produced as a recombinant fusion protein. The fusion molecule may contain the 
modified bryodin 1 domain with antibody domain orientated towards the N-tenninus of 

20 the fusion molecule although the opposite orientation may be contemplated. 

Desired antibody specificities for linkage to the modified bryodin 1 molecule of the 
present include those directed towards intemalising antigen determinants. Examples of 
this class of antigen are rare but would include the A33 antigen [Heath, J.K. et al (1997) 

25 Proc. Natl Acad. Sci U.SA. 94: 469-474] and the GA733-1 antigen [US,5,840,854]. The 
carcinoembryonic antigen may also be contemplated for use and may be targeted by any 
of numerous antibodies but may include MEE23 [Chester, K.A. et al (1994) Lancet 343: 
455], A5B7 [WO92/010159], T84.66 [US,5,081,235] MN-14 [Hansen, H.J. et al (1993) 
Cancer 21' 3478-3485], COL-1 [US,5,472,693] and others. Other desired specificities 

30 include antibodies directed to non-intemalising antigens and this may include antigens 
such as the 40kDa glycoprotein antigen as recognised by antibody KSl/4 [Spearman et al 
(1987) /. Pharmacol Exp. Therapeutics 241: 695-703] and other antibodies. Other 
antigens such as the epidermal growth factor receptor (HERl) or related receptors such as 
HER2 may be selected including anti-GD2 antibodies such as antibody 14.18 ... 
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[US,4,675,287; EP 0 192 657], or antibodies to the prostate specific membrane antigen 
[US,6,107,090], the IL-2 receptor [US,6,013,256], the Lewis Y determinant, mucin 
glycoproteins or others may be contemplated. 

5 In all instances where a modified bryodin 1 protein is made in fusion with an antibody 
sequence it is most desired to use antibody sequences in which T cell epitopes or 
sequences able to bind MHC class n molecules or stimulate T cells or bind to T cells in 
association with MHC class n molecules have been removed. 



10 A further embodiment of the present invention, the modified bryodin 1 protein may be 
Unked to a non-antibody protein yet a protein able to direct a specific binding interaction 
to a particular target cell. Such protein moieties include a variety of polypeptide ligands 
for which there are specific cell surface receptors and include therefore numerous 
cytokines, peptide and polypeptide hormones and other biological response modifiers. 

15 Prominent examples include such proteins as vascular epithelial growth factor, epidermal 
growth factor, heregulin, the interleukins, interferons, tmnour necrosis factor and other 
protein and glycoprotein molecules. Fusion proteins of these and other molecules with 
bryodin 1 of the present invention may be contemplated and may comprise the modified 
bryodin 1 moiety in either the N-temainal or C-temiinal orientation with respect to the 

20 protein ligand domain. Equally, chemical cross-linking of the purified ligand to the 
modified bryodin 1 protein may be contemplated and within the scope of the present 
invention. 

In a further embodiment the modified bryodin 1 protein of the present may be used as a 
25 complex containing a water soluble polymer such as hydroxypropylmethacrylamide or 
other polymers where the modified bryodin 1 protein is in covalent attachment to the 
polymer or in a non-covalent binding interaction with the polymer. Such an embodiment 
may additionally include an antigen binding domain such as an antibody or a firagment of 
an antibody in combination with the polymer bryodin 1 complex. 



Jn a further aspect still, the mvention relates to methods for therapeutic treatment using 
pharmaceutical preparations comprising peptide or derivative molecules with sequence 
identity or part identity with the sequences herein disclosed. 
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In a yet further aspect, the major immimogenic epitopes herein disclosed and relating to 
the bryodin 1 molecule are also shown herein to be present within the primary sequence 
of a number of other type 1 RIP proteins of which bryodin 1 is an example. Thus the 
proteins a-trichosanthin (ITCS), a-momorchaiin (IMO^fQ and 6-momorcharin (1CF5) 

5 and others noay be shown by protein sequence analysis to contain sequence elements with 
identity or near identity to the inomunogenic regions of the bryodin 1 molecule. FIGURE 
3 depicts sequence comparisons between bryodin 1 major epitopes and sequence elements 
from ITCS, IMOM and 1CF5 proteins. The present invention in so far as it relates to 
peptides and modified sequences derived from the bryodin 1 protein, where the identical 

10 or substantially similar sequences are identified within other proteins, these are 

considered equally to fall under the scope of the present. This is particularly true for 
some of the preferred mutation sets identified herein. For example the changes within R2 
and R3 implemented in the bryodin 1 sequence may be appUed for the removal of MHC 
class n Ugands from the equivalent regions within the ITCS sequence. Equally, the Rl 

15 changes in bryodin 1 comprising one or more of the substitutions T49A, L50M, H52A, 
NssP, Y56T, leiP, V65A and V67A can be applied to the equivalent regions with the 
proteins ITCS and ICFS. In the foregoing, numbering is according to the bryodin 1 
sequence. A proportion of the preferred R4 and R5 changes may also be implemented 
within the REP proteins ITCS, 1CF5 and IMOM and equally fall under the scope of the 

20 present invention. 

In as far as this invention relates to modified bryodin 1, compositions containing such 
modified bryodin 1 proteins or fragments of modified bryodin 1 proteins and related 
compositions should be considered within the scope of the invention. A pertinent 

25 example in this respect could be development of peptide mediated tolerance induction 
strategies wherein one or more of the disclosed peptides is administered to a patient with 
immunotherapeutic intent. Accordingly, synthetic peptides molecules, for example one of 
more of those listed in FIGURE 1 or more preferably sequences comprising all or part of 
any of the epitope regions Rl - R5 as defined above. Such peptide are considered 

30 embodiments of the invention. 

In anotfier aspect, the present invention relates to nucleic acids encoding modified 
. bryodin 1 entities. 
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The invention will now be illustrated by the experimental examples below. The invention 
is additionally illustrated by the figures described below: 

FIGURE 1 provides a list of peptide sequences in bryodin 1 with potential 
5 human MHC class n binding activty Peptides are 13-mers, amino acids are 

identified using single letter code 

FIGURE 2 provides a table of the bryodin 1 15-mer peptide sequences analysed 
using the naive hvonan in vitro T-cell assay of EXAMPLE 2. The peptide ID# and 
10 position of the N~terminal peptide residue within the bryodin 1 sequence is 

indicated 

FIGURE 3 indicates the sequence elements Rl, R2, R3, R4 and R5 from the 
bryodin 1 (IBRY) sequence which give a stimulation index of 2.0 or greater in 
15 PBMC preparations from 2 or more donors PBMC using the naiVe human in vitro 

T-cell assay of EXAMPLE 2. Corresponding sequences from related proteins a- 
trichosanthin (ITCS), a-momorcharin (IMOM) and B-momorcharin (1C3F5) are 
shown beneath each bryodin 1 sequence. Sequences are identical to bryodin 1 
except where indicated. Amino acids are deleted using single letter code. 



20 



25 



FIGURE 4 shows the percent of donor responses to individual bryodin 1 peptides. 
The total number of 85 peptides were tested using PBMC preparations from 21 
donor samples. A positive response is taken as an SI > 2, epitope regions are 
identified where positive responses are seen in 2 or more donors. 



FIGURE 5 shows representative stimulation Index (SI) plots from naive himian 
T-cell proliferation assays. Responses are shown for l^M and SjxM 
concentrations of peptide. Each peak is the mean of a triplicate assay. 
Panel A shows PBMC responses from 3 donor samples to bryodin 1 peptides 
30 encompassed within epitope region Rl . 

. Panel B shows PBMC responses from 2 donor samples to bryodin 1 peptides 
encompassed within epitope region R2. 
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Panel C shows PBMC responses from 2 donor samples to bryodin 1 peptides 
encompassed within epitope region R3. 

Panel D shows PBMC responses from 3 donor samples to bryodin 1 peptides 
encompassed within epitope region R5. 

5 

FIGURE 6 is a depiction of the MHC class n ligands identified within epitope 
region Rl. Ligands are identified using the in silico system of EXAMPLE 1. In 
this case the binding profile of 18 human DR allotypes are displayed as columns. 
The ligands detected are 13-mers and residue number 1 of each 13-mer is 
10 identified by a coloured block. The intensity of the binding interaction (High , 

Medium or Low) for each peptide with respect to each of the 18 allotypes is 
indicated according to the key displayed. 

FIGURE 7 is a depiction of the MHC class n ligands identified within epitope 
15 region R2. Ligands are identified using the in silico system of EXAMPLE 1. in 

this case the binding profile of 18 human DR allotypes are displayed as columns. 
The ligands detected are 13-mers and residue number 1 of each 13-mer is 
identified by a coloured block. The intensity of the binding interaction (High , 
Medium or Low) for each peptide with respect to each of the 18 allotypes is 
20 indicated according to the key displayed. 

FIGURE 8 is a depiction of the MHC class n ligands identified within epitope 
region R3. Ligands are identified using the in silico system of EXAMPLE 1. In 
this case the binding profile of 18 human DR allotypes are displayed as columns. 
25 The ligands detected are 13-mers and residue number 1 of each 13-mer is 

identified by a coloured block. The intensity of the binding interaction (High , 
Medium or Low) for each peptide with respect to each of the 18 allotypes is 
indicated according to the key displayed. 

30 FIGURE 9 is a depiction of the MHC class 11 ligands identified within epitope 

. region R4. Ligands are identified using the in sUico system of EXAMPLE 1. In 
this case the binding profile of 18 human DR allotypes are displayed as columns. 
The ligands detected are 13-mers and residue numbCT 1 of each 13-mer is 
identified by a coloured block. The intensity of the binding interaction (High , 
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Medium or Low) for each peptide with respect to each of the 18 allotypes is 
indicated according to the key displayed. 



10 



5 



FIGURE 10 is a depiction of the MHC class n ligands identified within epitope 
region R5. ligands are identified using the in silico systenoi of EXAMPLE 1. In 
this case the binding profile of 18 human DR allotypes are displayed as columns. 
The ligands detected are 13-mers and residue number 1 of each 13-mer is 
identified by a coloured block. The intensity of the binding interaction (High , 
Medium or Low) for each peptide with respect to each of the 18 allotypes is 
indicated according to the key displayed 



FORMULA 1 depicts a most preferred bryodin 1 structure featuring alternative 
substitutions which could be considered for incorporation into a bryodin 1 
molecule with a reduced immunogenic potential. 



15 

EXAMPLE 1 

Method of identifying epitopes in bryodin 1 using an in silico system for coftducting 
peptide MHC binding analyses: 

20 There are a number of factors that play important roles in determining the total structure 
of a protein or polypeptide. First, the peptide bond, i.e., that bond which joins the amino 
acids in the chain together, is a covalent bond. This bond is planar in structure, 
essentially a substituted amide. An "amide" is any of a group of organic compounds 
containing the grouping -CONH-. 

25 The planar peptide bond linking Ca of adjacent amino acids may be represented as 




depicted below: 

Because the 0=C and the C-N atoms lie in a relatively rigid plaiie, free rotation does not 
occur about these axes. Hence, a plane schematically depicted by the intermpted line is 
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sometimes referred to as an "amide" or "peptide plane" plane wherein lie the oxygen (O), 
carbon (C), nitrogen (N), and hydrogen (H) atoms of the peptide backbone. At opposite 
comers of this amide plane are located the Ca atoms. Since there is substantially no 
rotation about the 0=C and C-N atoms in the peptide or amide plane, a polypeptide chain 

5 thus comprises a series of planar peptide linkages joining the Ca atonais. 
A second factor that plays an important role in defining the total structure or 
conformation of a polypeptide or protein is the angle of rotation of each amide plane 
about the common Ca linkage. The terms "angle of rotation" and "torsion angle" are 
hereinafter regarded as equivalent temis. Assuming that the O, C, N, and H atoms remain 

10 in the amide plane (which is usually a valid assumption, although there may be some 
slight deviations from planarity of these atonis for some conformations), these angles of 
rotation define the N and R polypeptide's backbone conformation, i.e., the stmcture as it 
exists between adjacent residues. These two angles are known as <|) and A set of the 
angles ^u^Vu where the subscript i represents a particular residue of a polypeptide chain, 

15 thus effectively defines the polypeptide secondary structure. The conventions used in 
defining the <{), \|f angles, i.e., the reference points at which the anaide planes form a zero 
degree angle, and the definition of which angle is (|), and which angle is for a given 
polypeptide, are defined in the literature. See, e.g„ Ramachandran et al. Adv, Prot Chem. 
23:283-437 (1968), at pages 285-94, which pages are incorporated herein by reference. 

20 

The present method can be applied to any protein, and is based in part upon the discovery 
that in humans the primary Pocket 1 anchor position of MHC Class n molecule binding 
grooves has a well designed specificity for particular annino acid side chains. The 
specificity of this pocket is determined by the identity of the amino acid at position 86 of 
25 the beta chain of the MHC Class n molecule. This site is located at the bottom of Pocket 

1 and determines the size of the side chain that can be accommodated by this pocket. 

i 

Marshall, K.W., J. Immunol, 152:4946-4956 (1994). If this residue is a glycine, then aU 
hydrophobic aliphatic and aromatic amino acids (hydrophobic aliphatics being: valine, 
leucine, isoleucine, methionine and aromatics being: phenylalanine, tyrosine and 
30 tryptophan) can be accommodated in the pocket, a preference being for the aromatic side 
. chains. If this pocket residue is a valine, then the side chain of this amino acid protrudes 
into the pocket and restricts the size of peptide side chains that can be acconmiodated 
such that only hydrophobic aliphatic side chains can be accommodated. Therefore, in an 
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amino acid residue sequence, wherever an amino acid with a hydrophobic aliphatic or 
aromatic side chain is found, there is the potential for a MHC Class II restricted T-cell 
epitope to be present. If the side-chain is hydrophobic aliphatic, however, it is 
approximately twice as hkely to be associated with a T-cell epitope than an aromatic side 
5 chain (assuming an approximately even distribution of Pocket 1 types throughout the 
global population). 

A computational method embodying the present invention profiles the likelihood of 
peptide regions to contain T-cell epitopes as follows: 

(1) The primary sequence of a peptide segment of predetermined length is scaimed, and 

10 aU hydrophobic aliphatic and aromatic side chains present are identified. (2)The 

hydrophobic aliphatic side chains are assigned a value greater than that for the aromatic 
side chains; preferably about twice the value assigned to the aromatic side chains, e.g., a 
value of 2 for a hydrophobic aliphatic side chain and a value of 1 for an aromatic side 
chain. (3) The values determined to be present are summed for each overlapping amino 

15 acid residue segment (window) of predetermined uniform length within the peptide, and 
the total value for a particular segment (window) is assigned to a single amino acid 
residue at an intermediate position of the segment (window), preferably to a residue at 
about the midpoint of the sampled segment (window). This procedure is repeated for 
each sampled overlapping amino acid residue segment (window). Thus, each amino acid 

20 residue of the peptide is assigned a value that relates to the likelihood of a T-cell epitope 
being present in that particular segment (window). (4) The values calculated and assigned 
as described in Step 3, above, can be plotted against the amino acid coordinates of the 
entire amino acid residue sequence being assessed. (5) All portions of the sequence which 
have a score of a predetermined value, e.g., a value of 1, are deemed likely to contain a T- 

25 cell epitope and can be modified, if desired. 

This particular aspect of the present invention provides a general method by which the 
regions of peptides likely to contain T-cell epitopes can be described. Modifications to the 
peptide in these regions have the potential to modify the MHC Class n binding 
characteristics. 

30 According to another aspect of the present invention, T-cell epitopes can be predicted 
with greater accuracy by the use of a more sophisticated computational method which 
takes into account the interactions of peptides with models of MHC Class II alleles. 
The computational prediction of T-cell epitopes present within a peptide according to this 
particular aspect contemplates the constmction of models of at least 42 MHC Class n 
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alleles based upon the structures of all known MHC Class H molecules and a method for 
the use of these models in the computational identification of T-cell epitopes, the 
constmction of libraries of peptide backbones for each model in order to allow for the 
known variability in relative peptide backbone alpha carbon (Ca) positions, the 
5 constmction of libraries of amino-acid side chain conformations for each backbone dock 
with each model for each of the 20 amino-acid alternatives at positions critical for the 
interaction between peptide and MHC Class n molecule, and the use of these libraries of 
backbones and side*chain conformations in conjunction with a scoring function to select 
the optimum backbone and side-chain confomiation for a particular peptide docked with a 
10 particular MHC Class n molecule and the derivation of a binding score from this 
interaction. 

Models of MHC Class IE molecules can be derived via homology modeling from a 
number of similar structures found in the Brookhaven Protein Data Bank ("PDB")- These 
may be made by the use of semi-automatic homology modeling software (Modeller, Sali 
15 A. & Blundell TL., 1993. J. Mol Biol 234:779-815) which incorporates a simulated 
annealing function, in conjunction with the CHARMm force-field for energy 
minimisation (available from Molecular Simulations Inc., San Diego, Ca.). Alternative 
modeling methods can be utilized as well. 

The present method differs significantly from other computational methods which use 
20 libraries of experimentally derived binding data of each amino-acid alternative at each 
position in the binding groove for a small set of MHC Class n molecules (Marshall, 
K.W., et al, Biomed. Pept. Proteins Nucleic Acids, 1(3): 157-162) (1995) or yet other 
computational methods which use similar experimental binding data in order to define die 
binding characteristics of particular types of binding pockets within the groove, again 
25 using a relatively small subset of MHC Class n molecules, and then 'mixing and 
matching' pocket types from this pocket library to artificially create further Virtual' 
MHC Class n molecules (Stumiolo T., et al., Nat. Biotech, 17(6): 555-561 (1999). Botii 
prior methods suffer the major disadvantage that, due to the complexity of the assays and 
the need to synthesize large numbers of peptide variants, only a small nimiber of MHC 
30 Class n molecules can be exp^mentally scanned. Therefore the first prior method can 
only make predictions for a smaU number of MHC Class n molecules. The second prior 
method also makes the assumption that a pocket lined with similar amino-acids in one 
. molecule will have the same bmding characteristics when in the context of a different 
Class n allele and suffers further disadvantages in that only those MHC Class II 
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molecules can be Virtually' created which contain pockets contained within the pocket 
library. Using the modeling approach described herein, the structure of any number and 
type of MHC Class n molecules can be deduced, therefore alleles can be specifically 
selected to be representative of the global population. In addition, the number of MHC 
5 Class n molecules scanned can be increased by making further models further than 
having to generate additional data via complex experimentation. 

The use of a backbone library allows for variation in the positions of the Ca atoms of the 
various peptides being scanned when docked with particular MHC Class 11 molecules. 
This is again in contrast to the alternative prior computational methods described above 

10 which rely on the use of simplified peptide backbones for scanning amino-acid binding in 
particular pockets. These simplified backbones are not likely to be representative of 
backbone conformations found in 'real' peptides leading to inaccuracies in prediction of 
peptide binding. The present backbone library is created by superposing the backbones of 
all peptides bound to MHC Class U molecules found within the Protein Data Bank and 

15 noting the root mean square (RMS) deviation between the Ca atoms of each of the eleven 
amino-acids located within the binding groove. While this library can be derived from a 
small number of suitable available mouse and human structures (currently 13), in order to 
allow for the possibility of even greater variability, the RMS figure for each C-a position 
is increased by 50%. The average Ca position of each amino-acid is then determined and 

20 a sphere drawn around this point whose radius equals the RMS deviation at that position 
plus 50%. This sphere represents all allowed Ca positions. 

Working from the Ca with the least RMS deviation (that of the amino-acid in Pocket 1 as 

mentioned above, equivalent to Position 2 of the 11 residues in the binding groove), the 

sphere is three-dimensionally gridded, and each vertex within the grid is then used as a 

25 possible location for a Ca of that amino-acid. The subsequent amide plane, corresponding 

to the peptide bond to the subsequent amino-acid is grafted onto each of these Cos and 

I 

the ^ and Xj; angles are rotated step-wise at set intervals in order to position the subsequent 
Ca. If the subsequent Ca falls within the 'sphere of allowed positions' for this Ca than 
the orientation of the dipeptide is accepted, whereas if it falls outside the sphere then the 
30 dipeptide is rejected. 

This process is then repeated for each of the subsequent Ca positions, such that the 
peptide grows from the Pocket 1 Ca 'seed', until all nine subsequent Cas have been 
positioned from all possible permutations of the preceding Cocs. The process is then 
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repeated once more for the single Ca preceding pocket 1 to create a library of backbone 
Ca positions located within the binding groove. 

The number of backbones generated is dependent upon several factors: The size of the 

'spheres of allowed positions' ; the fineness of the gridding of the 'primary sphere' at the 
5 Pocket 1 position; the fineness of the step-wise rotation of the ^ and iir angles used to 

position subsequent Cos. Using this process, a large library of backbones can be created. 

The larger the backbone library, the more likely it will be that the optimum fit will be 

found for a particular peptide within the binding groove of an MHC Class n molecule. 

Inasmuch as all backbones will not be suitable for docking with all the models of MHC 
10 Class n molecules due to clashes with amino-acids of the binding domains, for each allele 

a subset of the library is created comprising backbones which can be accommodated by 

that allele. 

The use of the backbone library, in conjunction with the models of MHC Class II 
molecules creates an exhaustive database consisting of allowed side chain conformations 

15 for each amino-acid in each position of the binding groove for each MHC Class II 

molecule docked with each allowed backbone. This data set is generated using a simple 
steric overlap function where a MHC Class II rnolecule is docked with a backbone and an 
amino-acid side chain is grafted onto the backbone at the desired position. Each of the 
rotatable bonds of the side chain is rotated step-wise at set intervals and the resultant 

20 positions of the atoms dependent upon that bond noted. The interaction of the atom with 
atoms of side-chains of the binding groove is noted and positions are either accepted or 
rejected according to the following criteria: The sum total of the overlap of all atoms so 
far positioned must not exceed a pre-determined value. Thus the stringency of the 
conformational search is a function of the interval used in the step- wise rotation of the 

25 bond and the pre-determined limit for the total overlap. This latter value can be small if it 
is known that a particular pocket is rigid, however the stringency can be relaxed if the 
positions of pocket side-chains are known to be relatively flexible. Thus allowances can 
be made to imitate variations in flexibility within pockets of the binding groove. This 
conformational search is then repeated for every amino-acid at every position of each 

30 backbone when docked with each of the MHC Class II molecules to create the exhaustive 
database of side-chain conformations. 

A suitable mathematical expression is used to estimate the energy of binding between 
models of MHC Class n molecules in conjunction with peptide ligand conformations 
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which have to be empirically derived by scanning the large database of backbone/side- 
chain confpnnations described above. Thus a protein is scanned for potential T-cell 
epitopes by subjecting each possible peptide of length varying between 9 and 20 amino- 
acids (although the length is kept constant for each scan) to the following computations: 

5 An MHC Class II molecule is selected together with a peptide backbone allowed for that 
molecule and the side-chains corresponding to the desired peptide sequence are grafted 
on. Atom identity and interatomic distance data relating to a particular side-chain at a 
particular position on the backbone are collected for each allowed conformation of that 
amino-acid (obtained from the database described above). This is repeated for each side- 

10 chain along the backbone and peptide scores derived using a scoring function. The best 
score for that backbone is retained and the process repeated for each allowed backbone 
for the selected model. The scores from aU allowed backbones are compared and the 
highest score is deemed to be the peptide score for the desired peptide in that MHC Class 
n model. This process is then repeated for each model with every possible peptide 

15 derived from the protein being scanned, and the scores for peptides versus models are 
displayed. 

In the context of the present invention, each ligand presented for the binding affinity 
calculation is an amino-acid segment selected from a peptide or protein as discussed 
above. Thus, the ligand is a selected stretch of amino acids about 9 to 20 amino acids in 

20 length derived from a peptide, polypeptide or protein of known sequence. The terms 
"amino acids" and "residues" are hereinafter regarded as equivalent terms. 
The ligand, in the form of the consecutive amino acids of the peptide to be examined 
grafted onto a backbone from the backbone library, is positioned in the binding cleft of an 
MHC Class n molecule from the MHC Class n molecule model library via the 

25 coordinates of the C'-C} atoms of the peptide backbone and an allowed conformation for 
each side-chain is selected from the database of allowed conformations. The relevant 
atom identities and interatomic distances are also retrieved from this database and used to 
calculate the peptide binding score. ligands with a high binding affinity for the MHC 
Class n binding pocket are flagged as candidates for site-directed mutagenesis, Aonino- 

30 acid substitutions are made in the flagged ligand (and hence in the protein of interest) 
which is then retested using the scoring function in order to determine changes which 
reduce the binding affinity below a predetermined threshold value. These changes can 
then be incorporated into the protein of interest to remove T-cell epitopes. 
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Binding between the peptide ligand and the binding groove of MHC Class n naolecules 
involves non-covalent interactions including, but not lindted to: hydrogen bonds, 
electrostatic interactions, hydrophobic (lipophilic) interactions and Van der Walls 
interactions. These are included in the peptide scoring function as described in detail 
5 below. 

It should be understood that a hydrogen bond is a non-covalent bond which can be formed 
between polar or charged groups and consists of a hydrogen atom shared by two other 
atoms. The hydrogen of the hydrogen donor has a positive charge where the hydrogen 
acceptor has a partial negative charge. For the purposes of peptide/protein interactions, 
10 hydrogen bond donors may be either nitrogens with hydrogen attached or hydrogens 
attached to oxygen or nitrogen. Hydrogen bond acceptor atoms may be oxygens not 
attached to hydrogen, nitrogens with no hydrogens attached and one or two connections, 
or sulphurs with only one connection. Certain atoms, such as oxygens attached to 
hydrogens or imine nitrogens (e.g. C=NEO may be both hydrogen acceptors or donors. 
Hydrogen bond energies range from 3 to 7 Kcal/mol and are much stronger than Van der 
Waal's bonds, but weaker than covalent bonds. Hydrogen bonds are also highly 
directional and are at their strongest when the donor atom, hydrogen atom and acceptor 
atom are co-linear. 

Electrostatic bonds are formed between oppositely charged ion pairs and the strength of 
the interaction is inversely proportional to the square of the distance between the atoms 
according to Coulomb's law. The optimal distance between ion pairs is about 2.8A. In 
protein/peptide interactions, electrostatic bonds may be formed between arginine, 
histidine or lysine and aspartate or glutamate. The strength of the bond will depend upon 
the pKa of the ionizing group and the dielectric constant of the medium although they are 
approximately similar in strength to hydrogen bonds. 

Lipophilic interactions are favorable hydrophobic-hydrophobic contacts that occur 
between he protein and peptide ligand. Usually, these will occur between hydrophobic 
amino acid side chains of the peptide buried within the pockets of the binding groove 
such that they are not exposed to solvent. Exposure of the hydrophobic residues to solvent 
is highly unfavorable since the surrounding solvent molecules are forced to hydrogen 
bond with each other forming cage-like clathrate stractures. The resultant decrease in 
entropy is highly unfavorable. Lipophilic atoms may be sulphurs which are neither polar 
nor hydrogen acceptors and carbon atoms which are not polar. 
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Van der Waal's bonds are non-specific forces found between atoms which are 3-4A apart. 
They are weaker and less specific than hydrogen and electrostatic bonds. The distribution 
of electronic charge around an atom changes with time and, at any instant, the charge 
distribution is not symmetric. This transient asymmetry in electronic charge induces a 

5 similar asymmetry in neighboring atoms. The resultant attractive forces between atoms 
reaches a liaaximum at the Van der Waal's contact distance but diminishes very rapidly at 
about 1 A to about 2k. Conversely, as atoms become separated by less than the contact 
distance, increasingly strong repulsive forces become dominant as the outer electron 
clouds of the atoms overlap. Although the attractive forces are relatively weak compared 

10 to electrostatic and hydrogen bonds (about 0.6 Kcal/mol), the repulsive forces in 
particular may be very important in determining whether a peptide ligand may bind 
successfully to a protein. 

In one embodiment, the B6hm scoring function (SCOREl approach) is used to estimate 
the binding constant. (Bohm, H.J., J. Comput Aided Mol Des., 8(3):243-256 (1994) 

15 which is hereby incorporated in its entirety). In another embodiment, the scoring function 
(SCORE2 approach) is used to estimate the binding affinities as an indicator of a ligand 
containing a T-cell epitope (Bohm, HJ., J. Comput Aided Mol Des., 12(4):309-323 
(1998) which is hereby incorporated in its entirety). However, the Bohm scoring 
functions as described in the above references are used to estimate the binding affinity of 

20 a ligand to a pirotein where it is already known that the ligand successfully binds to the 
protein and the protein/ligand complex has had its structure solved, the solved stracture 
being present in the Protein Data Bank CTDB"). Therefore, the scoring function has 
been developed with the benefit of known positive binding data. In order to allow for 
discrimination between positive and negative binders, a repulsion term must be added to 

25 the equation. In addition, a more satisfactory estimate of binding energy is achieved by 
computing the lipophilic interactions in a pairwise manner rather than using the area 
based energy term of the above Bohm functions. 

Therefore, in a preferred embodiment, the binding energy is estimated using a modified 
Bohm scoring function. In the modified Bohm scoring function, the binding energy 
30 between protein and ligand (AGbind) is estimated considering the following parameters: 
The reduction of binding energy due to the overall loss of translational and rotational 
entropy of the ligand (AGo); contributions from ideal hydrogen bonds (AGhb) where at 
least one partner is neutral; contributions from unperturbed ionic interactions (AQonic); 
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lipophilic interactions between lipophilic ligand atoms and lipophilic acceptor atoms 
(AGupo); the loss of binding energy due to the freezing of internal degrees of freedom in 
the ligand, i.e., the freedom of rotation about each C-C bond is reduced (AGrot); the 
energy of the interaction between the protein and ligand (Evdw). Consideration of these 

5 terms gives equation 1 : 

(AGbmd)=( AGo)+( AGhbxNhbM AGiomcXNioiiic)+( AGHpoxNupo)+( AGrot+Nrot)+(E vdw)- 
Where N is the number of qualifying interactions for a specific term and, in one 
embodiment, AGo, AQib, AGionic, AGupo and AGrot are constants which are given the 
values: 5,4, -4.7, -4.7, -0,17, and 1,4, respectively. 

10 The term Nhb is calculated according to equation 2 : 

Nhb = Zh-bondsf(AR, Aa) X f(Nneighb) X 

fpcs 

f(AR, Aa) is a penalty function which accounts for large deviations of hydrogen bonds 
from ideality and is calculated according to equation 3 : 
f(AR, A-D) = f 1(AR) X f2(Aa) 



15 Where: 


fl(AR) 


= lif AR<=TOL 




or 


= 1 - (AR - TOL)/0.4 if AR <= 0.4 + TOL 




or 


= Oif AR>0.4 + TOL 


And: 


f2(Aa) 


= 1 if Aa<30° 




or 


= l-( Aa - 30)/50 if Aa <=80° 


20 


or 


= Oif Aa>80° 



TOL is the tolerated deviation in hydrogen bond length = 0.25 A 

AR is the deviation of the H-O/N hydrogen bond length from the ideal value = 

Aa is the deviation of tiie hydrogen bond angle Z n/o-h-.o/n from its idealized value of 

180^ 

25 f(Nneigbb) distinguishes between concave and convex parts of a protein surface and 

therefore assigns greater weight to polar interactions found in pockets rather than those 
found at the protein surface. This function is calculated according to equation 4 below: 
f(Nneighb) = (Nncigiib/Nncighb.o) " where a = 0.5 

Nndghb is the number of non-hydrogen protein atoms that are closer than 5 A to any given 
30 protein atom. 

Nncighb.o is a constant = 25 
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fpcs is a function which allows for the polar contact surface area per hydrogen bond and 
therefore distinguishes between strong and weak hydrogen bonds and its value is 
determined according to the following criteria: 
fp^^= 6 when Apoiai/NHs < 10 
5 or fpcs= 1 when Apoiai/NnB > 10 

Apoiar is the size of the polar protein-ligand contact surface 
Nhb is the number of hydrogen bonds 
B is a constant whose value = 1,2 

For the implementation of the modified B5hm scoring function, the contributions from 
10 ionic interactions, AGiomc, are computed in a similar fashion to those from hydrogen 
bonds described above since the same geometry dependency is assumed. 
The term Nupo is calculated according to equation 5 below: 
Nupo = ZiLf(riL) 

f(riL) is calculated for all lipophilic ligand atoms, 1, and all lipophilic protein atoms, L, 
15 according to the following criteria: 

f(riL) =1 when Til <= RlffruJ =(riL - R1)/(R2-R1) when R2 <riL > Rl 

f(riL) =0 when rrL>=R2 

Where: Rl=ri"''" + rJ^^ + 0,5 

andR2 = Rl + 3,0 
20 and ri^^* is flie Van der Waal's radius of atom 1 

and tl^^^ is the Van der Waal's radius of atom L 

The term Nrot is the number of rotable bonds of the amino acid side chain and is taken to 

be the number of acyclic sp^ - sp^ and sp^ - sp^ bonds. Rotations of terminal -CH3 or - 

NH3 are not taken into account. 
25 The final term, Evdw, is calculated according to equation 6 below: 

Evdw = ei82((ri^'" +r2^^^)' V - (n^^" +r2^'")'/r^), where: 

El and 82 are constants dependant upon atom identity 

j.^vdw ^^^vdw ^ Waal's atomic radii 

r is the distance between a pair of atoms. 
30 With regard to Equation 6, in one embodiment, the constants ei and 82 are given the atom 

values: C: 0.245, N: 0.283, 0: 0.316, S: 0.316, respectively (i.e. for atoms of Carbon, 

Nitrogen, Oxygen and Sulphur, respectively). With regards to equations 5 and 6, tiie Van 

der Waal's radii ^e given tiie atom values C: 1.85, N: 1.75, O: 1.60, S: 2.OOA. 
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It should be understood that all predetenzuned values and constants given in the equations 
above are determined within the constraints of current understandings of protein ligand 
interactions with particular regard to the type of computation being undertaken herein. 
Therefore, it is possible that, as this scoring function is refined further, these values and 
5 constants may change hence any suitable numerical value which gives the desired results 
in terms of estimating the binding energy of a protein to a ligand may be used and hence 
fall within the scope of the present invention. 

As described above, the scoring function is applied to data extracted from the database of 
side-chain conformations, atom identities, and interatomic distances. For the purposes of 

10 the present description, the number of MHC Class n molecules included in this database 
is 42 models plus four solved structures. It should be apparent from the above 
descriptions that the modular nature of the construction of the computational method of 
the present invention means that new models can simply be added and scanned with the 
peptide backbone library and side-chain conformational search function to create 

15 additional data sets which can be processed by the peptide scoring function as described 
above. This allows for the repertoire of scanned MHC Class H molecules to easily be 
increased, or structures and associated data to be replaced if data are available to create 
more accurate models of the existing alleles. 

The present prediction method can be calibrated against a data set comprising a large 
20 number of peptides whose affinity for various MHC Class II molecules has previously 
been experimentally determined. By comparison of calculated versus experimental data, 
a cut of value can be determined above which it is known that all experimentally 
determined T-cell epitopes are correctly predicted. 

It should be understood that, although the above scoring function is relatively simple 
25 compared to some sophisticated methodologies that are available, the calculations are 
performed extremely rapidly. It should also be understood that the objective is not to 
calculate the true binding energy per se for each peptide docked in the binding groove of 
a selected MHC Class II protein. The underlying objective is to obtain comparative 
binding energy data as an aid to predicting the location of T-cell epitopes based on the 
30 primary structure (i.e. amino acid sequence) of a selected protein. A relatively high 
binding energy or a binding energy above a selected threshold value would suggest the 
presence of a T-cell epitope in the ligand. The ligand may then be subjected to at least 
one round of amino-acid substitution and the binding energy recalculated. Due to the 
rapid nature of the calculations, these manipulations of the peptide sequence can be 
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performed interactively within the program* s user interface on cost-effectively available 
computer hardware. Major investment in computer hardware is thus not required. 
It would be apparent to one skilled in the art that other available software could be used 
for the same purposes. In particular, more sophisticated software which is capable of 

5 docking ligands into protein binding-sites may be used in conjunction with energy 
minimization. Examples of docMng software are: DOCK (Kuntz et al, J. Mol BioU 
161:269-288 (1982)), LUDI (Bohm, H.J., J. Comput Aided Mol Des., 8:623-632 (1994)) 
and FLEXX (Rarey M.. et al, ISMB, 3:300-308 (1995)). Examples of molecular 
modeling and manipulation software include: AMBER (Tripos) and CHARMm 

10 (Molecular Simulations Inc.). The use of these computational methods would severely 
limit the throu^put of the method of this invention due to the lengths of processing time 
required to make the necessary calculations. However, it is feasible that such methods 
could be used as a *secondary screen' to obtain more accurate calculations of binding 
energy for peptides which are found to be •positive binders' via the method of the present 

15 invention. 

The limitation of processing time for sophisticated molecular mechanic or molecular 
dynamic calculations is one which is defined both by the design of the software which 
makes these calculations and the current technology limitations of computer hardware. It 
may be anticipated that, in the future, with the writing of more efficient code and the 
20 continuing increases in speed of computer processors, it may become feasible to make 
such calculations within a more manageable time-frame. 

Further information on energy functions applied to macromolecules and consideration of 
the various interactions that take place within a folded protein structure can be found in: 
Brooks, BJR., et at, Comput. Chem., 4:187-217 (1983) and further information 
25 concerning general protein-ligand interactions can be found in: Dauber-Osguthorpe et al., 
Prarem^4(l):3 1-47(1988), which are incorporated herein by reference in their entirety. 
Useful background information can also be found, for example, in Fasman, G.D., ed.. 
Prediction of Protein Structure and the Principles of Protein Conformation, Plenum 
Press, New York, ISBN: 0-306 4313-9, 

30 

EXAMPLE 2 

Method of mapping epitopes in bryodin 1 using naXve human T-cell proliferation assays: 
The interaction between MHC, peptide and T-cell receptor (TCR) provides the stmctural 
basis for the antigen specificity of T-cell recognition. T-cell proliferation assays test the 
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binding of peptides to MHC and the recognition of MHC/peptide complexes by the TCR. 
In vitro T-cell proliferation assays of the present example, involve the stimulation of 
peripheral blood mononuclear cells (PBMCs), containing antigen presenting cells (APCs) 
and T-cells. Stimulation is conducted in vitro using synthetic peptide antigens, and in 
5 some experiments whole protein antigen. Stimulated T-cell proliferation is measured 
using ^H-thymidine (^H-Thy) and the presence of incorporated ^-Thy assessed using 
scintillation counting of washed fixed cells. 

Buffy coats from human blood stored for less than 12 hours were obtained from the 
10 National Blood Service (Addenbrooks Hospital, Cambridge, UK). Ficoll-paque was 
obtained from Amersham Pharmacia Biotech (Amersham, UK). Serum free AIM V 
media for the culture of primary human lymphocytes and containing L-glutamine, 
50jxg/ml streptomycin, 10|Lig/ml gentomycin and 0.1% human serum albumin was from 
Gibco-BRL (Paisley, UK), Synthetic peptides were obtained from Eurosequence 
15 (Groningen, The Netherlands) and Babraham Technix (Cambridge, UK). 

Erythrocytes and leukocytes were separated from plasma and platelets by gentle 
centrifrigation of buffy cdats. The top phase (containing plasma and platelets) was 
removed and discarded. Erythrocytes and leukocytes were diluted 1:1 in phosphate 

20 buffered saline (PBS) before layering onto ISrol ficoll-paque (Amersham Pharmacia, 
Amersham UK). Centrifugation was done according to the manufacturers recommended 
conditions and PBMCs were harvested fix>m the serum+PBS/ficoU paque interface. 
PBMCs were mixed with PBS (1:1) and collected by centrifugation. The supernatant was 
removed and discarded and the PBMC pellet resuspended in 50ml PBS. . Cells were again 

25 pelleted by centrifugation and the PBS supematant diiscarded. Cells were resuspended 
using 50ml AIM V media and at this point counted and viability assessed using trypan 
blue dye exclusion. Cells were again collected by centrifugation and the supematant 
discarded. Cells were resuspended for cryogenic storage at a density of 3x10^ per ml» 
The storage medium was 90%(v/v) heat inactivated AB human serum (Sigma, Poole, UK) 

30 and 10%(v/v) DMSO (Sigma, Poole, UK). Cells were transferred to a regulated freezing 
container (Sigma) and placed at -70°C overnight When required for use, cells were 
thawed rapidly in a water bath at 37°C before transferring to 10ml pre-warmed AIM V 
medium. 
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PBMC were stimulated with protein and peptide antigens in a 96 well flat bottom plate at 
a density of 2x10^ PBMC per well. PBMC were incubated for 7 days at 37°C before 
pulsing with ^H-Thy (Amersham-Phamacia, Amersham, UK). For the present study, 
synthetic peptides (ISmers) that overlapped by 12 amino acids were generated that 
5 spanned the entire sequence of bryodin 1. Peptide identification numbers (ID#) and 
sequences are given in FIGURE 2. Each peptide was screened individually against 
PBMC's isolated firom 21 nai've donors. Two control peptides that have previously been 
shown to be immunogenic and a potent non-recall antigen KLH were used in each donor 
assay. 

10 The control antigens used in this study were as below: 



Peptide 


Sequence 


C-32 


Biotin-PKYVKQNTLKLAT 
Flu haemagglutinin 307-319 


C-49 


KWDQIKKISKPVQH 
Chlamydia HSP 60 peptide 


KLH 


Whole protein from Keyhole 
Limpet Hieniocyanin. 



Peptides were dissolved in DMSO to a final concentration of lOmM, these stock solutions 
were then diluted 1/500 in AIM V media (final concentration 20\xM). Peptides were 

15 added to a flat bottom 96 well plate to give a final concentration of 2 and 20pM in a 

100|Lil. The viability of thawed PBMC's was assessed by trypan blue dye exclusion, cells 
were then resuspended at a density of 2x10^ cells/ml, and 100|Lil (2x10^ PBMC/well) was 
transferred to each well containing peptides. Triplicate well cultures were assayed at each 
peptide concentration. Plates were incubated for 7 days in a humidified atmosphere of 

20 5% CO2 at 3TC. Cells were pulsed for 18-21 hours with IfiCi ^-Thy/well before 

harvesting onto filter mats. CPM values were determined using a Wallac n[iicroplate beta 
top plate counter (Perkin Ehner). Results were expressed as stimulation indices, derived 
by division of the proliferation score (e.g. counts per minute of radioactivity) measured to 
the test peptide by the score measured in cells not contacted with a test peptide. 

25 
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The present studies have uncovered sonie 32 peptide sequences able to evoke a significant 
proliferative response (i.e. SI>2.0) in T-cells derived from at least one donor. Within this 
set of peptides, a further sub-set of peptides have been identified which evoke a 
significant proliferative response in 2 or more individual donor samples and for some of 

5 theses responses the niagnitude of response has indeed been significantly higher than 
SI=2.0. Mapping T cell epitopes in the bryodin 1 sequence using the T cell proliferation 
assay resulted in the identification of five major immunogenic regions encompassed by 
peptide ID#16-18, 30, 38-41, 46-50 and 60-64. For each of these peptides, PBMCs 
prepared from 2 or more donor samples showed a stimulation index >2.0. FIGURE 5 

10 panels A - E show representative histograms of SI responses to individual peptides in 
selected PBMC donor samples. Collectively the panels have been selected to demonstrate 
examples of positive responses to peptides from each of the epitope regions Rl - R5. 
The tissue types for all PBMC samples were assayed using a commercially available 
reagent system (Dynal, Wirral, UK). Assays were conducted in accordance with the 

15 suppliers recommended protocols and standard ancillary reagents and agarose 

electrophoresis systems. Allotypic coverage forDRBl alleles was 70% in the 20 donors 
tested. Of the 21 different PBMC donor preparations 10 were reactive to peptides 
encompassed within epitope regions Rl - R5. The allotypic specificities of each of the 
responsive donor samples is given in TABLE 1. 

20 

Table 1 



MHC Allotypes of responsive donor samples 



■ Donca:# 


" -lilHC Allotype ; V .-^ . 


4 


DRB1*03, DRB1*04. DRB3, DKB4*01 


5 


DRB1*07, DRB1*09, DRB4*01 


6 


DRB1*13, DRB1*15, DRB3, DRB5 


9 


DRB1<*04, DRB1*12, PRB3, DRB4*01 


12 


DRB1*07, DRB1*13, DRB3, DRB4*01 


14 


. DRB1*04,DRB4*01 


15 


DRB1*03, DRB 1*14, DRB3 


17 


DRB1*12, DRB 1*15, DRB3, DRB5 


19 


DRB 1*03, DRB 1*07, PRB3, DRB4*01 


21 


DRB1*08. DRB1*14, DKB3 
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EXAMPLES 

Design of modified sequences with improved immunogenicity profiles: 

The method of EXAMPLE 1 was used in an analysis of the epitope regions Rl, R2, R3, 

R4 and R5. The system enables prediction of the particular MHC ligands encompassed 

5 within the biologically detected epitope regions and provides a "score" with respect to the 
ability of a given MHC class n ligand to interact with a particular MHC allotype. The 
allotypic restriction pattern for the MHC ligands can be depicted using the allotypic 
restriction chart displays as provided for each of the epitope regions R1-R5 in the 
accompanying FIGURES 6-10. 

10 The analysis was extended to consideration of sequence modifications within each of the 
epitopes Rl - R5. The sequence variants were tested for continued ability bind MHC 
class n and their binding scores where these remained. Multiple amino acid substitutions 
were defined which achieved elimination of MHC class n binding with the majority of 
MHC allotypes tested. The particular substitutions identified were further tested for their 

15 ability to be accommodated within the structural model of the bryodin molecule. 

Designed mutations on the selected residues of the wild type sequence were checked for 
steric clashes, hydrogen bonding formation, hydrophobic interactions and its general 
accommodation in the structure. Substitutions that gave rise to steric clashes were 
dismissed. Substitutions that were acconomodated when the side chain was adopting a 

20 similar configuration (rotamer) to the original residue was considered acceptable. If more 
than one substitutions fulfilled these criteria, residues that potentially form hydrogen 
bonds with neighbouring side chains or backbone atoms, and/or form favourable 
hydrophobic contacts or other associations were preferred. The above procedure was 
performed interactively using Swiss Prot Deep View v3.7 [Guex, N. and Peitsch, M.C. 

25 (1997) Electrophoresis 18: 2714-2723]. This process resulted in a prefecred substitution 
set for each of the epitope regions Rl- R5. The substitution sets were compiled to 
produce the stmcture depicted in FORMULA L All substitutions were confirmed to 
result in removal of the MHC class n ligands within each of the epitope regions Rl - R5. 
Substitutions featuring alternative amino acid residues in addition to the most preferred 

30 substitutions are given with the FORMULA 1. 

For epitope region R3 alternative substitution sets were designed enabling the option of 
leaving leucine 115 in the wild-type configuration. This residue it thought to be 
structurally important forming part of the substrate binding cleft for the bryodin 1 
enzyme. A preferred set of substitutions involving LI 15 comprises the changes LusA, 
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I122A, I126A L130A, L133F and I137A. An alternative set of substitutions which maintain 
L115 comprise AngT, G120H, K121S and R123T • These changes would be made in 
alternative to the dual changes LnsA and I122A. 



5 



